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ABSTRACT 

Oevolopaients of efficient and accurate automated 
Information extraction procedures for analysis of 
multitemporal Landsat data In non-U. S. crop In- 
ventory and monitoring can provide a greatly Im- 
proved capability for practical and affordable 
use on a global basis without requiring ground 
observat1(W)s. 


I. INTRODUCTION 

At the launch of Thematic Mapper, research 
In satellite-aided crop Inventory and monitoring 
of global production continues to make advances 
toward practical, viable capabilities and im- 
provement in current procedures. These capabl- 
lltleSt when added to current agricultural Infor- 
mation systems, are expected to provide more 
timely and more accurate non-U. S. crop Informa- 
tion than Is now available. 

While the overall objective of this re- 
search^ Is to develop technology for extracting 
agricultural Information of various kinds, the 
focus Is on Improved non-U.S* production forecast 
technology which will be evaluated by the U.$. 
Department of Agriculture (USOA) for possible 
Integration Into its information systems. More 
specifically, the objective is to develop pro- 
cedures for using aerospace remote sensing and 
related technology to provide more objective and 
reliable crop area forecasts at several times 
during the growing season and with Improved pre- 
harvest estimates for selected countries and 
crops. 

We have adopted four categories of factors 
that reflect key characteristics of technology 
and which guide us In the research. These are 
timeliness* affordability, general applicability, 
and accuracy. Timeliness emphasizes the quick 
extraction of Information. Timeliness Is associ- 
ated with early-season estimates as well as esti- 
mates made thorughout the season. Affordability 
reflects efficiency and inexpensiveness. General 


applicability refers to techniques that are ap- 
plicable for forel^ crop regions, while remain- 
ing objective and Improvable procedures. Accu- 
racy reflects the degree of bias and variance 
over time and the responsiveness to factors 
affecting departures from average. 


li. TECHNICAL APPROACH 

The technical approach Is to provide 
satellite-based, objective estimates of area, 
yield, and pro<hict1on as one Input Into a compre- 
hensive, wltldata-source Information system. 

The conceptual fr am ew o rk Involves estimating crop 
area and yield for specified regions and multi- 
plying them together to obtain promotion at the 
regional level. Automatic data processing 
approaches are considered necessary to provide 
objective, timely, and reliable estimates. Our 
research has been focused on the area estlmatlc^ 
component; hence, further discussion will be pri- 
marily oriented to that component. Area esti- 
mates for regions are derived by processing s a- 
tUtlcal samples (called segments) of satellite 
digitlal Image data (Landsat MSS). The desire Is 
to estimate crop area periodically throughout the 
season, from the time of planting through 
harvest. 

Current approaches In crop area estimation 
extensively utilize statistical sample survey 
methodology. Sampling methodology allows reli- 
able estimates to be made by processing only a 
very small portion of the data* Expansion of the 
sample estimates to a regional estimate Is refer- 
red to as aggregation and 1$ a statistical pro- 
cess that Is made more complex by non-response 
(generally because of Intervening cloud cover) 
and partial response. 

Since non-U.S. ground data may be available 
In limited quantities for research studies but 
would not be available to support an nr^ratlonal 
system, and because of efficiency and timeliness 
considerations, our approach to non-U.S. crop 
forecasting does not require ground observations. 
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Tht variability In the crop scene envlron- 
laent amng crop regions and countries Is quite 
large. It Is also strongly desirable to Minimize 
huaan Interaction. Hence» recent approachef have 
concentrated on the development of robust proce- 
(kires that are largely self-adaptive. In terms of 
recoil zing crops based on their spectral signa- 
tures over time and distance. Some adaptation to 
specific crop regions is still necessary. The 
foremost challenge In satellite-oriented global 
crop foreccsting Is crop estimation without the 
ground observations that serve to train (adapt) 
remote sensing procedures which use them. 

Tecnnology development has been based on use 
of extensive ground observations obtained In the 
United States where tht reliability of the obser- 
vations Is understood. Regions In the United 
States that are similar to foreign crop regions 
of Interest have been selected as study areas. 
Since true analogue regions do not exist, these 
forel^ similarity regions In the Ibilted States 
are not completely suitable. Incremental testing 
over domains of greater variability of Indepen- 
dent data sets Is required. Such testing serves 
to establish levels of performance. Isolate the 
technology deficiencies, and suggest required 
avenues of resolution. Even so, testing on such 
data setr Is limited to the variability of avail- 
able characteristics. However, by using the 
available data to establish the sensitivity of 
the methodology to these parameters, simulation 
techniques can be employed to establish the per- 
formance over a wider variety of conditions* 


III. DESCRIPTION OF EXPERIMENTS 

The objective In our research experiments 
has been to develop and evaluate state-of-the-art 
technologies for estimating the crop area of 
spring small grains, summer crops, corn, and soy- 
beans, In order to obtain an understanding of 
where limitations in performance exist and so 
that RSD efforts can be focused there. These 
experiments represent the first "Independent" 
sub-system level sensitivity tests against the 
adopted performance criteria of timeliness, 
affordability, applicability, and accuracy. 

The technology developed for crop are 
estimation of spring small grains and that for 
summer crops, corn, and soybeans reflected two 
different stages of technical maturity. For 
spring small grains, a technical breakthrough has 
been achieved (figure 1). A hl^ly efficient 
state, froR. a man/machine systefss point of view, 
has been attained by modeling the interpretive 
intelligence of skilled and expert image analysts 
In an automated and objective form - simple, art- 
Iflcal Intelligence. Area estimation technology 
for the summer crops, corn, and soybeans, being 4 
years later 1r Initiation, was In a less mature 
stage of development. Thus, two separate experi- 
ments (one for spring small grains and one for 
summer crops, corn, and soybeans) were designed to 
optimize the evaluations of these technologies. 


A. SRRING SMALL GRAINS 

Area estimation technology available for 
evaluation proceed regional aggregations as well 
as component crop area proportion estimates 
within sample segments* 

The segisent proportion estimation component 
was represented two different technical 
approaches to the same basic Identification 
scheme of determining different spectral crop 
appearance development over time. Both ap- 
proaches exhibit an efficiency Improvement 
through automation of an order of magn1tu<te, when 
compared to previous analyst-intensive proceihires 
(figure 2). The first procedure (SSG3C) Is a 
completely automated modeling^ of the previous 
mamially Intensive, skilled and tifptrt analyst 
Interpretative procedure. The basic functions 
consist of: acquisition selection based on 
meteorological variables and anticipated Landsat 
spectral responses for spring small grains: a 
transform of Landsat data to a level of greenness 
observed; an automatic, multi temporal, pixel- 
labeling logic based on a hierarchical process; 
and a proportion estimator based on a systematic 
sample of the labeled pixels. SSG38 (s«ne 
technical approach as SSGX) which arows an 
analyst override of the automatic, acr.ulsltlon- 
selectlon module was also developed^ and 
evaluated In this experiment. 

The second proportion estlmatloo approach 
(SSG4) was also completely automatic^. The basic 
functional differences between this approach and 
that of the SSG3 were: the Landsat data are 

transformed by a model which "normalizes" color; 
a "field" finding algorithm groups the individual 
pixels Into quasi-fields; the quasi-fields are 
labeled by a multitemporal logic of "green/not- 
green" sequence; and the proportion estimate Is 
derived from an enumeration of the labeled fields 
plus an adjustment for estimated omission rates. 

A major objective of the experiment was to 
ascertain If the cost benefits of procedure 
automation could be achieved with comparable 
accuracy to the ariulyst-lntenslve procedures. 

The Landsat data used for the small grains 
experiment consisted of sample segments collected 
over a 4-year period (1976-1979) covering the 
U.S. Northern Great Plains (North Dakota, South 
Dakota, Montana, and Minnesota) and Saskatchewan, 
Canada (figure 3). Only segments that had ground 
observations for evaluation were analyzed, with 
the exception of 1978 samples of North Dakota In 
which all allocated Landsat segments were 
Included to support aggregation studies. 

The 1976 crop year In this region was warmer 
than average. The crop season was earlier than 
average, as was the case In 1977, even though the 
temperature was near average. The 1978 crop year 
was cooler than average, which gave normal to 
late planting. Late planting was experienced In 
1979 as well. 
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Figure 1»- Technical breakthrough (modeling of analyst function enabled automatic processing). 



Figure 2.- Improved efficiencies of automated information extraction process. 
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(A) 1t7« StTtS (VWATHtR: WARM; 
MASON: SARLVI 




Figure 3.- Site locations and meteorological summary for U.S. and Canada spring small grains* 


Ideally* the data set for this experiment 
would be completely representative of non-U.S. 
crop regions. While It Is not, this Is the most 
extensive data set ever used In testing. Inclu- 
ding about 9,000 square miles of ground observa- 
tions spread through four states and one province 
over 4 crop-years. 


8. SUmER CROPS 

The level of evaluation of summer crops, 
corn, and soybeans technology was more limited in 
both scope and purpose, due to Its less mature 
state. The results of this experiment would pro- 
vide Identification and quantification of the 
major subcomponent contributors to the proportion 
estimation error for future developmental modifi- 
cation prior to the development of an automatic 
processing approach. 

The summer :rops, corn, and soybeans pro- 
portion estimation technology (called CS-1) per- 
formed functions similar to the small grains pro- 
cedures, such as: Landsat data transformation 

and feature extraction; target definition and 
stratification; labeling; sampling; and propor- 
tion estimation. However, the design of the 
experimental procedure was structured and 
analyst-intensive, to allow the tabulation of 
results at each detailed procedural step for 
subsequent evaluation and performance analyses. 


The Landsat data for the summer crops, com, 
and soybeans experiment consisted of 18 sample 
segments from the 1978 crop year and 10 sample 
segments from the 1979 crop year. A large data 
base for evaluation, such as was available for 
small grains, does not yet exist for corn and 
soybeans. These segments were drawn from the 
U.S. Central Corn Belt states of Iowa, Illinois, 
Indiana, and Missouri. 


IV. RESULTS 


A. SPRING SMALL GRAINS EXPERIMENT^ 

North Dakota Aggregations . Evaluation of 
these procedures, developed specifically to esti- 
mate non-U. S. spring small grains area, were 
carried out over areas In the United States (and 
Canada) where reliable reference data were avail- 
able before attempting adaptation. The results 
of North Dakota aggregations for 1978 data are 
(percent relative error and coefficient of vari- 
ation): ^3.5 (C.V. ■ 4.4), ♦6.9 (C.V. ■ 4.3), 

and -9.2 (C.V. - 4.6) for SSG38, SS6X and SSfi4 
respectively. 

The performance Is expressed relative to the 
published USOA estimate of the spring grains 
acres harvested In North Dakota during 1978 
(13.12 million acres). 
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Su b$ystw L«v1 Resu lts. The SMple scgeant 
propol^ion estimitloA eccurecy pcrformnces ere 
shown In figure 4. The results shown Include ell 
segment enelyses Including mechlne errors, c1er1> 
cel errors, end otherwise outliers. (These un- 
edited results ere else Included in the eggrege- 
tlons.) The Intent wes to Identify end quentify 
all potential contributions to error, without 
regard to producing the best estimate. By not 
thresholding seemingly obvious outliers, we opti- 
mize the probability of Isolating the major sub- 
component error contributors. Even with this. It 
was somewhat encouraging that the mean absolute 
error In most cases was less then 10 percent. 
Also, the results when compered to the more labor 
Intensive historical procedures verify what was 
expected, I.e. , generally comparable with lower 
bias and somewhat larger variance. 

It Is of Interest to get an Indication of 
the possible performance of the technology for a 
key foreign region. Similar performance was ob- 
tained for segments within a previously deter- 
mined USSR Foreign Similarity Region (FSR). 

In general, the automatic procedures 
currently provide estimates for fewer segments 
(50 percent to 65 percent of total allocated) 


compared to the historical manual procedure 
(75 percent). 

In terms of the other performance criteria, 
figure 2 shows a summary of key efficiency para- 
meters related to affordability. Although nei- 
ther the historical nor the current automated 
technologies were engineered nor Implemented for 
an operational environment (substantial overhead 
Is necessary In an experimental mode for record- 
ing of Intermediate output), the relative Impro- 
vements exhibited by the automated technologies 
Is obvious. The timeliness In the growing season 
Is some 30 days prior to harvest. 

In summary, this first-time evaluation of 
highly automated spring small grains area estima- 
tion technologies was very encouraging. There 
was definite Improvement In near-harvest estima- 
tion efficiency with modest losses In accuracy as 
compared to the best previous analyst-intensive 
approaches. For the I-year, one-state (1978 N.O.) 
aggregation, no glaring deficiencies were noted. 
The attractive characteristics of low processing 
cost, objectivity, repeatability, modularity, and 
adaptability packaged within an automated frame- 
work make the outlook for meaningful advancements 
very optimistic. 
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Figure 4.- Accuracy conparlsons of spring small grains techniques. 
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B. SUMMER CROPS. CORN. AND SOYBEANS EXPERIMENT^ 

This expurlMfit In tmmr crops, com ond 
soybeans proportion estlaatlon tachnolooy rtprt* 
stnts our first attofspt to take advantaBo of an 
Inprovod systems approach to research and allowed 
the effective utilization of this environment for 
more Iterations of development, testlnp. and 
feedback to research than heretofore achievable 
In a given time (such as 6 months). 

Figure 5 shows accuracy results In terms of 
mean error and 90 percent confidence limits for 
the previously described crop year (1978-79) 
data. 



raocEDuat 


Figure 5.- Comparison of corn and soybean 
baseline subsystem with previous results. 

The following are some addi tonal observations: 

1. There Is a very accurate estimate of crop 
group (summer crops) with RNE less than 

2 percent in both years. 

2. For crop type (corn and soybeans) there 
was a significant bias (com RME>15 per- 
cent. soybeans* 19 percent) In 1 year 
(1978). 

3. The crop type bias for the other year 
(1979) was not significant (corn RMF.*$ 
percent. soybeans*3 percent). 

4. The direction of the bias for crop type 
appears to be consistent between years 
(corn tends to be overestimated, soybeans 
tends to be underestimated). 


B. The standard deviations for all estimates 
are relatively consistent and comparable 
to those achieved In previous "best" 
spring small grains technologies. 

6. Compared to a pmvious procedure for the 
1978 crop years. CS-1 exhibited a signi- 
ficantly lower bias In estimating crop 
group (summer crop RNE -l.S vs. 

•16.4 percent) with lower standard (Avia- 
tions for both crop groups and crop 
types. 

In terms of the other performance criteria the 
following was observed: 

1. Timeliness - processing to crop type Is 
achievable after com tassel Ing which is 
30-45 days prior to harvest. This is 
about the middle of August In the Corn 
Belt. 

2. The processability rates with the CS-1 
technology were quite high, typically 
50 percent to 75 percent. 

Quick Identification of subcomponent 
contributors was achieved. These specific 
results have led to the earller-than-planned 
development of a more-automated summer crop, 
corn, and soybeans proportion estimation pro- 
cedure. The process of making the tevelopmental 
modifications for a new version (CS-IA). con- 
ducting a verification test, then designing and 
Implementing a semiautomatic version (CS-IB) has 
a1rea<tr been accoapllshed. Result* of early ver- 
ification testing over a sample (10 segments) of 
the 1978:79 data set show excellent summer crop 
accuracy’ and corn and soybeans accuracy of about 
10 percent relative mean error and standard devi- 
ations of 4 to 6 percent. 

The rifsults of this experiment are very 
encouraging because the develomental time frame 
Including procedure development, testing. Identi- 
fication of limitations, and procedure Improve- 
ment was accomplished In about one-fifth the time 
of previous research In this area. Additional 
Improvements are expected In this technology. 


V. SIGNIFICANCE OF. RESULTS 


The significance of the present results Is 
threefold: 

1. Me were able to adequately model the sub- 
jective human analyst with an objective 
process and achieve reasonable 
accuracies. 

2. ke were thereby able to develop an In- 
formation extraction technology which was 
not prohibitively costly either In terms 
of manual effort or computational re- 
sources. It Is affordable within a rea- 
sonable standard. 
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3. If th« rtsuUs rtporttd h«rt could b« 
achieved for foreign reg1i;ns, lubitantlal 
Improvements In global crop Information 
would result. 


This should not suggest that the non«U.S. Inven* 
tory and monitoring problem Is solved as several 
key research problems remain. Briefly, the most 
critical of these are; 


1. Improving the automated selection of 
acquisitions. 

2. Finding methods to estimate crop areas 
much earlier in the season. 

3. Finding Information extraction method* 
ol ogles which reduce the quantity and 
quality of the data required, thus 
reducing data costs. 

4. Finding methods for adapting these objec* 
tive analysis methods to other crops and 
regions without requiring Information 
which Is not available In non-U. S. situ- 
ations. 

5. Early quantification of the benefits of 
Improved performance of this technology 
(bje to the features offered by the The- 
matic Mapper. 

6. Doing adequate testing and evaluation to 
understand the technology performance. 


He feel that the present results represent a real 
breakthrough both In approach and practicality. 
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